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Abstract: In this paper, we present the design and implementation of the integrated proactive surveillance system for 
prostate cancer (PASS-PC). The integrated PASS-PC is a multi-institutional web-based system aimed at collecting a 
variety of data on prostate cancer patients in a standardized and efficient way. The integrated PASS-PC was 
commissioned by the Prostate Cancer Foundation (PCF) and built through the joint of efforts by a group of experts in 
medical oncology, genetics, pathology, nutrition, and cancer research informatics. Their main goal is facilitating the 
efficient and uniform collection of critical demographic, lifestyle, nutritional, dietary and clinical information to be used 
in developing new strategies in diagnosing, preventing and treating prostate cancer. 

The integrated PASS-PC is designed based on common industry standards - a three tiered architecture and a Service- 
Oriented Architecture (SOA). It utilizes open source software and programming languages such as HTML, PHP, CSS, 
JQuery, Drupal and MySQL. We also use a commercial database management system - Oracle llg. The integrated 
PASS-PC project uses a "confederation model" that encourages participation of any interested center, irrespective of its 
size or location. The integrated PASS-PC utilizes a standardized approach to data collection and reporting, and uses 
extensive validation procedures to prevent entering erroneous data. The integrated PASS-PC controlled vocabulary is 
harmonized with the National Cancer Institute (NCI) Thesaurus. Currently, two cancer centers in the USA are 
participating in the integrated PASS-PC project. 

The final system has three main components: 1. National Prostate Surveillance Network (NPSN) website; 2. NPSN 
myConnect portal; 3. Proactive Surveillance System for Prostate Cancer (PASS-PC). PASS-PC is a cancer Biomedical 
Informatics Grid (caBIG) compatible product. The integrated PASS-PC provides a foundation for collaborative prostate 
cancer research. It has been built to meet the short term goal of gathering prostate cancer related data, but also with the 
prerequisites in place for future evolution into a cancer research informatics platform. In the future this will be vital for 
successful prostate cancer studies, care and treatment. 

Keywords: Cancer research informatics, service-oriented architecture, prostate cancer, proactive surveillance, multi-center 
clinical data database, caBIG. 



INTRODUCTION 

Prostate cancer is a cancer that forms in tissues of the 
prostate (a gland in the male reproductive system found 
below the bladder and in front of the rectum). It is the most 
common cancer affecting men in the United States. More 
than 200,000 new cases are expected to be diagnosed in 
2011 [1]. The majority of diagnosed men will not have 
disease that will result in prostate cancer specific mortality; 
however, nearly 30,000 men will die from prostate cancer 
this year. The introduction of serum prostate specific antigen 
(PSA) screening (around 1990) led to a transient increase in 
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prostate cancer diagnosis. Furthermore, the pattern of initial 
presentation of patients shifted to men with low volume 
disease. The long natural history of this disease has been 
characterized [2] and as a result has raised concerns that 
there may be excessive use of local intervention in men with 
low risk disease. This is accentuated by the recognized 
morbidity of the various forms of local therapy. In 201 1, the 
initial report from the Veterans Administration sponsored 
PIVOT study [3] demonstrated a lack of mortality benefit for 
men with low-risk prostate cancer who underwent surgical 
intervention. Conversely, in ongoing active surveillance 
series, it has been shown that approximately one third of 
men deemed appropriate for active surveillance show 
evidence of progression that merits consideration of 
intervention while the remainder either remain stable and 
eventually terminate follow-up or, due to excessive anxiety, 



1874-4311/12 



2012 Bentham Open 



2 The Open Medical Informatics Journal, 2012, Volume 6 

elect to proceed with therapy despite a lack of evidence of 
progression [4]. 

These data together underscore the need for 
understanding the natural history and biology of low-risk 
disease and the impact of the practice patterns of active 
surveillance on men with low risk disease. Several academic 
institutions have programs of active surveillance in which 
men with low-risk cancers have undergone intense 
observation. Low-risk cancers are typically those which 1) 
are small volume prostate cancers that cannot be felt on a 
prostate examination (digital rectal exam) and 2) lack 
aggressive histological morphology (microscopic 
appearance). These active surveillance routines have been 
institution specific; as such, a more comprehensive active 
surveillance approach is necessary. Active surveillance that 
is accompanied with biospecimen collection represents a key 
need in prostate cancer research and an evolution of this 
process has been coined "pro-active surveillance". 

Currently, there are many open source and commercial 
clinical data management systems available such as Research 
Electronic Data Capture (REDCap) [5], OpenClinica [6], and 
Medidata [7], etc. Here, we give a brief overview of 
REDCap and OpenClinica. 

REDCap is a secure Web application for building and 
managing online surveys and databases. It employs a novel 
workflow methodology and the software solution is designed 
for rapid development and deployment of electronic data 
capture tools to support clinical and translational research 
[2]. It is built using PHP and MySQL. 

OpenClinica is powerful software for collecting and 
managing clinical trial data. It allows you to build your own 
studies, design electronic Case Report Forms (eCRFs), and 
conduct a full range of Electronic Data Capture (EDC) and 
Clinical Data Management (CDM) functions. It is a Web- 
based system and is built using J2EE and Oracle or 
PostgreSQL. 

The integrated PASS-PC is a Web-based distributed, 
heterogeneous clinical data system developed to support the 
research study entitled "Active Surveillance of Prostate 
Cancer" for multi-center clinical sites. 

The study has two objectives: 

Primary Objective 

To carefully observe men (active surveillance) with 
screened detected low risk prostate cancer and manage them 
without immediate curative intervention. 

Secondary Objective 

To explore urine and serum collected in order to develop 
and evaluate new and existing biomarkers for prostate 
cancer, evaluate biomarker changes, study gene expression 
profiles, and evaluate nuclear proteins. 

Web-based data management systems offer great 
potential for facilitating the conduct of large scale or multi- 
center clinical studies [8-11]. Investigators and researchers 
working across multiple sites with varying infrastructure can 
access data and analytical tools in these systems on a real- 
time basis, minimizing the logistical challenges in multi- 
center collaboration, providing improved monitoring 
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capability, and facilitating new mechanisms for producing 
high quality validated data [10]. The integrated PASS-PC is 
a Health Insurance Portability and Accountability Act 
(HIPAA) compliant and caBIG [12] compatible Web-based 
clinical data management system incorporating three main 
components: 

1. National Prostate Surveillance Network (NPSN) 
website - an informational website for proactive 
surveillance of prostate cancer; 

2. NPSN myConnect Portal - A secure patient 
registration web portal; 

3. PASS-PC - Secure study management portal for 
researchers and study coordinators. 

The integrated PASS-PC is based on a legacy stand-alone 
Microsoft Access database developed by John Hopkins 
University in 2006. After investigation of REDCap, 
OpenClinica, and Medidata, etc., we decided to design and 
implement an in-house system to 1) facilitate migration of 
data from the legacy Access database and 2) efficiently 
integrate with the NPSN myConnectdatabase. 

METHODS 

The integrated PASS-PC team elected to use a 
"confederation model" [13], as opposed to traditional data 
repository or network models that assume control of an 
individual center's data. A confederation model assumes that 
each participating site retains all rights to the acquired data 
that can be used by other integrated PASS-PC participants 
only after obtaining required permissions and approved by 
its Institutional Review Board (IRB). It is essential to have a 
standardized approach to data collection and reporting for 
this model to be successful. 

The integrated PASS-PC is designed and implemented 
based on common industry standards - a three tiered 
distributed architecture and a Service-Oriented Architecture 
(SOA). 

Based on the legacy standalone MS Access database 
developed by John Hopkins, the integrated PASS-PC defines 
and establishes the criteria for standardization of collection 
forms and identified research questions that must be 
addressed. Baseline and followup questionnaires and a 
dietary food frequency questionnaire (FFQ) have been 
implemented in the integrated PASS-PC. 

The integrated PASS-PC establishes a core data set of 
information to which all participating centers must 
contribute. The core data elements include the most common 
questions used in clinical, nutritional, and quality of life 
studies. Additionally, the core data set includes the following 
data elements: registering institution, staff member 
performing data entry, and patient identification code. 

Patients will provide information on demographic, 
lifestyle, physical activity, dietary habits, family history, 
male and female relatives' health, and medical history; 
whereas information on diagnostic studies, pathology/ 
staging, treatment, surgeries, and biospeciments can only be 
provided by research coordinator. 

The integrated PASS-PC controlled vocabulary is 
harmonized with NCI Thesaurus. The data elements have 
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been defined based on the NCI Cancer Data Standards 
Registry and Repository (caDSR) [14]. 

First, the data model is built using the open source data 
modeling software - ArgoUML [15], and metadata is created 
based on NCI's controlled vocabulary. Next, the Unified 
Modeling Language (UML) data model is loaded into the 
NCI's caDSR production server. Finally, the data definition 
language (DDL) script for Oracle database is generated from 
the data model. The process flow for the system model 
design is shown in Fig. (1). 

According to step 1, we first need to create the object and 
data models for the underlying PASS-PC database. Here, we 



use ArgoUML 0.28, a free open source application, to create 
the models. The output of this procedure is a UML (XML 
format) file. Screenshots of the partial object and data 
models in ArgoUML are shown in Figs. (2, 3), respectively. 

The PASS-PC web application is developed using 
HTML, PHP, CSS and JQuery with an Oracle llg backend 
database. The Apache web server and Oracle database server 
are running on the Red Hat Linux. 

The NPSN website is developed using PHP and the 
Drupal content management framework, and runs on the Red 
Hat Linux Apache web server. 
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Fig. (1). Process flow for the system model design. 
The algorithmic steps for the process flow are described as: 

Begin 

Step 1: Build the object and data models using UML modeling tool such as ArgoUML or Enterprise Architect [16]. 

Step 2: There are several separate steps in running Semantic Integration Workbench (SIW): 

Step 2.1 (by Model Owner): Review unannotated XML Metadata Interchange (XMI) or UML file built in Step 1. 

Step 2.2 (by Model Owner): Perform XMI or UML roundtrip. 

Step 2.3 (by Model Owner): Run semantic connector. 

If no errors such as invalid data types and "unbounded array" in model found in steps 2.1-2.3, then proceed to step 2.4; otherwise, go to step 
1 to correct the errors and repeat steps 2.1-2.3. 

Step 2.4 (by Vocabulary Reviewer): Send XMI or UML file via email to the NCICB to curate the file. 

Step 2.5 (by Model Owner): Review annotated XMI or UML file. 

Step 2.6 (by Model Owner): Generate default Global Model Exchange (GME) tags. 

Step 2.7 (by Model Owner): GME cleanup. 

If no errors found in steps 2.4-2.7, then proceed to step 3; otherwise, go to step 1 to correct the errors and repeat steps 2.1-2.7. 
Step 3: Run UML Loader by the NCICB to load the approved annotated XMI or UML file into the caDSR. 

End. 
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Fig. (2). Partial object model of the PASS-PC. 



The patient's registration web portal {my Connect) has a 
login interface on the NPSN website and utilizes a MySQL 
backend database. 

The MySQL and Oracle databases are located on the two 
different physical Red Hat Linux database servers at 
CSMC's Enterprise Information Services (EIS) data center. 
These two databases exchange stored patient health 
information (PHI) in an encrypted format via Web Service 
call. In current project's phase I stage, there is only 
unidirectional information flow - from myConnecfs MySQL 
database to PASS-PC's Oracle database via Web Service on 
hourly basis. In project's phase II stage, there will be 
bidirectional information flow between my Connect' s 
MySQL database and PASS-PC's Oracle database via Web 
Service. 

The high level system architecture is shown in Fig. (4). 

Fig. (5) displays the procedure for determining patient's 
eligibility for the study. Fig. (6) shows the procedure of 
PASS-PC connectivity to NPSN and myConnect web portals. 

Cedars-Sinai Medical Center is the Data Coordinating 
Center (DCC) for this Active Surveillance study. Multiple 
clinic sites will be able to simultaneously access and store 
PHI in the integrated PASS-PC. As such, security is the first 
priority in designing the system in order to meet HIP A A 
compliance. 

In keeping with the CSMC's EIS standards, the web 
server is SSL encrypted and all browsers accessing the 



server will use https. This applies to both internal and 
external users. 

User logins to databases associated with integrated 
PASS-PC is via 2-factor authentication utilizing username, 
password and control-ID. 

Authenticated users will be presented with different web 
interfaces based on their respective user role such as 
Program Administrator, Research Coordinator orViewer 
which is assigned by a system administrator at the time of 
user registration. A program administrator can view, edit 
and add PHI records in the PASS-PC database. He/she can 
also add new users to its site. A program administrator can 
generate different types of reports based on various database 
queries and export reports in a format compatible with 
common statistical software packages such as SAS. A 
research coordinator can view, edit and add PHI records in 
the PASS-PC database. He/she can also generate reports 
based on various database queries and export reports in a 
format compatible with common statistical software 
packages such as SAS. A viewer can only view PHI records 
and generate reports. 

All users belong to a clinical site and are restricted to 
accessing PHI associated with that site. 

All PHI is stored in an encrypted format in both PASS- 
PC and myConnect databases. 

Passwords are also encrypted and are set at a minimum 
of 12 characters and must include one number, one upper 
case letter, one lower case letter and one special character. 



The Integrated Proactive Surveillance System for Prostate Cancer 



The Open Medical Informatics Journal, 2012, Volume 6 5 



n annotated.PASS-PC_vl_rvJ.uml - Class Diagram 2 - ArgoUML * 



File Edit View Create Arrange Generation Critique lools Help 

: S5HHM*] jMMiE |HL5 ~ sua 



Order By Type, Name 



H HEALTH_HISTORY_NUMBER 
S ID 
H YEAR 

y FK_ANNUAL_UPDT_LTR_PAT_INFO 

3 PK_ANNUAL_UPDT_LTR 

ATTEMPT_PSA_SUMMER_YEAR 

BASELINE_QA 

BASELINE_QA_2 

BIOPSY 

Fl VE_YE AR_LI FE ST YLE_QA 

FIVE_YEAR_LIFESTYLE_QA2 

HOSPITAL 

IDENTIFICATION 

MASTER 

PATHOLOGY 

PATIENTJNFORMATION 

PATIENT_LOG 

PCA3 

PHONEJ.OG 
PHYSICIAN 

POST_ACTIVE_SURV_FOLLOWUP 

PROSPECTIVE_TRACKING 

PROSPECTIVE_TRACKING_PHONE_LOG 

PSA 

SITE 

TREATMENT 



I ® 

Kirn 



|B|5l|T^[r5ig[n^ 



-•HI . >'iv.. L..(t.i 7 L E 1 1" I f '-I, 



STUDY _ID_NUMBER : VARCHAR2 
LAST_NAME VARCHAR2 
FIRST_NAME : VARCHAR2 
PREFIX VARCHAR2 
SUFFIX : VARCHAR2 

HEALTH_HISTORY_NUMBER VARCHAR2 
CITY : VARCHAR2 
STATE : VARCHAR2 
ZIPCODE : VARCHAR2 
PHONE_NUMBER : VARCHAR2 
CONFIRM ATI ON_BX_DATE : DATE 
DATE_OF_DIAGNOSIS DATE 
BASELINE_BX_DATE DATE 
BASELINE_DATE_BASED_ON : VARCHAR2 
TARGET_DATE_1_YEAR_FOLLOWUP : DATE 
TA R G ET_D ATE_5_Y E A R_F 0 LLOWU P : DATE 
TRACKING_NUMBER : VARCHAR2 
COMMENTS : VARCHAR2 
NOTES : VARCHAR2 
DOB : DATE 

BASELINEJD : NUMBER 
ADDRESSJ VARCHAR2 
ADDRESS_2 : VARCHAR2 



<PK>» PK_IDENTIFICATION(STUDY_ID_NUMBER VARCHAR2) 



LU 



«table>> 

Logical View Data Model: TREATMENT 



TREATMENTJD 4 1 Class TREATMENT 

HEALTH_HISTORY_NUMBER : VARCHAR2 
TREATMENT :VARCHAR2 
TR E ATM E NT_DATE DATE 
RRP_GLEASON : VARCHAR2 
RRP_GRADE VARCHAR2 
SURG_PATH : VARCHAR2 
CURABLE : VARCHAR2 
SV : VARCHAR2 
LN : VARCHAR2 
EPE : VARCHAR2 
MARGINS VARCHAR2 
SURGICAL_PROSTATE_VOLUME : NUMBER 
SITE_RRP_DOC : VARCHAR2 
SITE_RRP_DOC_YN VARCHAR2 
TREATED_OUTOF_SITE : VARCHAR2 
TREATED_OUTOF_SITE_YN : VARCHAR2 
TERTIARY_PATTERN NUMBER 



<PK» PK_TREATM ENT(TREATM ENTJ D NUMBER) 
<FK» FK_TREATMENT_PATIENT_INFORMATION(HEALTH_HISTORY_NU 



As Diagram | 



□ High 
o- C!l Medium 
o- ___ Low 



I ▼ j 60 Items 4 ^ToDoltem A Properties | A Documentation A Presentation A Source A Constraints A Stereotype A Tagged Values A Checklist 



El Class* B < 



Visibility 
* public O 



EfeMb Model (Model Logical View] 
package O protected O private 



Client Dependencies: 
j Supplier Dependencies: 
" | ^ [ 4 | Generalization 
Specialization: 



TREATMENT->Treatment [Model Looici Attributes: 
Operations: 

Association: 

Owned Elements: 



H TREATMENT ID 



y PK TREATMENT 



- FK TREATMENT PATIENT INFORMATIC 





□ Root 


□ Leaf 


□ Abstract 


□ Active 



Template Parameters: ' 



90Mused of1 10M total! 



Fig. (3). Partial data model of the PASS-PC. 



SM§ Sit. 



Legend 






PASS-PC 






< 


► 


NPSN 


<— 


— ► 








Patient Web browser 



Firewall 




Proxy Server 



PASS -PC 

PASS -P(T\ AD Authentication Database 
... , ^ \ Server 
Web Server 



Fig. (4). The high level system architecture of the integrated PASS-PC. 



All users will be forced to change their passwords on initial 
login and every six months thereafter. 

All users will be automatically logged out after 10 
minutes of inactivity. 



RESULTS 

The integrated PASS-PC is implemented as a cancer 
research informatics data repository to support the proactive 
surveillance for prostate cancer study. Fig. (7) presents the 
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screenshot of the query interface for generating various 
reports which can be downloaded and imported into standard 
statistics software. 
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Fig. (5). The procedure for determining patient's elgibilty for the 
study. 1. Patient calculates eligibility. 2. Site returns results and 
next steps. 3. Patient meets with physician for initial consultation. 
4. Physician validates that patient is eligible. 

The integrated PASS-PC uses a two-factor authentication 
methodology to prevent unauthorized access to the system. 
User of the system needs to provide the correct username 
and password to pass the first layer of authentication, and 
then he/she has to provide the correct control id to pass the 
second layer of authentication. The integrated PASS-PC 
maintains an audit trail of all data entries and user activities 
to protect the authenticity, integrity and confidentiality of all 
data entries. 



Fig. (6). The procedure of PASS-PC connectivity to NPSN and 
myConnect web portals. Coordinator logs into myConnect web 
portal and creates username, temporary password and control id. 5. 
Coordinator provides username, password and control id to patient. 
6. Patient signs into myConnect web portal and completes online 
enrollment forms. 7. Baseline data sent to PASS-PC database when 
patient completesenrollment. 8. Coordinator receives auto email 
upon patient enrollment submission. 9. Coordinator logs into 
PASS-PC to access patient data (can only access own site data). 

An entry is inserted into USER_AUDIT table whenever 
the following actions occur: 

a. User logs in to the PASS-PC database 

b. User logs out of the PASS-PC database 
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Fig. (7). Query interface for generating report in PASS-PC. 
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Variables included in entry of USER_AUDIT table are 
presented in Table 1. 

Table 1. Definition of Variables in the USER AUDIT Table 



Variable Name 


Definition 


USER ID 


Specific identification number of the database User 


LOGIN TIME 


The system date and time when the user logged in 
and began a session 


LOGOUTTIME 


The system date and time when the user logged out 
and ended a session 


IP ADDRESS 


The IP address of the computer where the user 
accesses the database 



An entry is inserted into OPERATIONAUDIT table 
whenever the following actions occur: 

a. User views a specific record 

b. User edits a specific record 

c. User creates a new record 

Variables included in entry of OPERATION AUDIT 
table are presented in Table 2. 

Table 2. Definition of Variables in the OPERATION AUDIT 
Table 



Variable Name 


Definition 


USER ID 


Specific identification number of the database 
User 


TIME STAMP 


The system date and time that the action was 
performed 


TABLE 


The specific table name where the action 
occurred 


COLUMN 


The specific table column name where the 
action occurred 


OPERATION TYPE 


The specific action that was performed by the 
user 


OLD VALUE 


The original value that existed prior to the new 
action 


NEW VALUE 


The new value that exists subsequent to the 
action 



Currently, there are about 800 prior patients in John 
Hopkins site needed to be re-consented to transfer their 
legacy data into the integrated PASS-PC. Currently, Cedars- 
Sinai Medical Center and John Hopkins Hospital are two 
participating sites. As a cancer research informatics 
computing platform for proactive surveillance for prostate 
cancer study, the integrated PASS-PC would lead to a better 
understanding and treatment of men with low risk prostate 
cancer. 

The integrated PASS-PC is a multi-disciplinary project. 
Its success is based on the collaborative efforts of multiple 
individuals and teams with expertise in medical oncology, 
genetics, pathology, prostate cancer, nutrition, and 
information technology. 
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The NPSN website and myConnect web portal can be 
accessed at: http://www.npsn.net:443. 

The PASS-PC can be accessed by participating clinic 
sites at: http://www.passpc.csmc.edu:443. 

DISCUSSION 

The integrated PASS-PC project, as a unique web-based, 
caBIG compatible and multi- center clinical data 
management system, has high reliability and extensibility. 
The integrated PASS-PC offers a number of benefits to its 
users, including: i) standardized data elements, vocabulary 
and forms for data collection; ii) high quality data validation 
for data entry; iii) highest level network and data security by 
firewall, vpn, two factor authentication, data encryption and 
audit trail; iv) generating various reports in spreadsheet 
format based on dynamic queries of data elements. 

PCCR [13], a pancreatic cancer collaborative registry, is 
a multi- institutional Web-based system aimed to collect a 
variety of data on pancreatic cancer patients and high-risk 
subjects in a standard and efficient way. PCCR also utilized 
a "confederation model" as its architecture. Similar to the 
integrated PASS-PC, PCCR also adopted standardized data 
element, controlled vocabulary and forms for data collection. 
PCCR's controlled vocabulary is in harmonization with NCI 
Thesaurus, as the same resource used by the integrated 
PASS-PC. Both systems follow the NCI caBIG compatible 
standards. The differences between the integrated PASS-PC 
and PCCR are: i) the integrated PASS-PC is developed in 
HTML, PHP, CSS, JQuery, Drupal, MySQL and Oracle 1 lg. 
PCCR is developed in Java/JSP and Oracle lOg; ii) the 
integrated PASS-PC consists of three components: NPSN 
website, NPSN myConnect portal and PASS-PC. It is based 
on the three-tier and service-oriented architecture. NPSN 
myConnect portal communicates with the PASS-PC through 
web service calls on an hourly basis. PCCR is a centralized 
registry. 

BCCR [17], a breast cancer collaborative registry, is a 
multicenter Web-based system aimed to collect and manage 
a variety of data on breast cancer patients and breast cancer 
survivors. BCCR is developed by following the same 
methodology as PCCR, so it is similar to the integrated 
PASS-PC except that the differences we listed above 
between the integrated PASS-PC and PCCR. 

Kong MY, et al. [18] described an ontology-based 
framework for clinical research databases. The Ontology- 
Based extensible data model (OBX) was developed to serve 
as a framework for clinical research data in the Immunology 
Database and Analysis Portal (ImmPort). Similar to the 
integrated PASS-PC, OBX is a relatively simple conceptual 
model. The difference between two systems is that the 
integrated PASS-PC is a specialized system for proactive 
surveillance for prostate cancer study; OBX is a general data 
model for an immunology database and analysis portal. 

In the market, REDCap, OpenClinica and Medidata are 
good clinical data management systems. But these systems 
are not suitable for this project because we need to migrate 
the legacy data stored in the MS Access database and the in- 
house software provides greater flexibility, control, 
extensibility and range of import/export options. 
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The integrated PASS-PC has a user- friendly graphical 
user interface (GUI) to help access the patient health 
information in the database. The security is first priority in 
designing and implementing the system. The current system 
also has some limitations. For example, it does not support 
the storage and retrieval of biospecimen images, and it does 
not integrate with other EMR systems currently running at 
Cedars-Sinai Medical Center. 

CONCLUSIONS 

The integrated PASS-PC project utilizes open source 
software and industry standards-based technologies in order 
to design, develop and deploy an extensible and integrative 
clinical data management platform. The design of the system 
follows the NCI's caBIG paradigm to facilitate the 
integration among heterogeneous data systems and 
information sharing among these data systems. 

The design principles of the integrated PASS-PC are 
generalized and should be informative to analogous efforts 
and programs. 

Currently, the project is in phase I stage -unidirectional 
information flow from NPSN myConnect database to the 
PASS-PC database. Phase II is in the planning stage to 
implement bidirectional information flow between 
myConnect database and the PASS-PC database. 
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